Rotten Apples or Bad Harvest? What We Are Measuring When We Are Measuring Abuse
نویسندگان
چکیده
Internet security and technology policy research regularly uses technical indicators of abuse in order to identify culprits and to tailor mitigation strategies. As a major obstacle, readily available data are often misaligned with actual information needs. They are subject to measurement errors relating to observation, aggregation, attribution, and various sources of heterogeneity. More precise indicators such as size estimates are costly to measure at Internet scale. We address these issues for the case of hosting providers with a statistical model of the abuse data generation process, using phishing sites in hosting networks as a case study. We decompose error sources and then estimate key parameters of the model, controlling for heterogeneity in size and business model. We find that 84 % of the variation in abuse counts across 45,358 hosting providers can be explained with structural factors alone. Informed by the fitted model, we systematically select and enrich a subset of 105 homogeneous “statistical twins” with additional explanatory variables, unreasonable to collect for all hosting providers. We find that abuse is positively associated with the popularity of websites hosted and with the prevalence of popular content management systems. Moreover, hosting providers who charge higher prices (after controlling for level differences between countries) witness less abuse. These factors together explain a further 77 % of the remaining variation, calling into question premature inferences from raw abuse indicators on security efforts of actors, and suggesting the adoption of similar analysis frameworks in all domains where network measurement aims at informing technology policy.
منابع مشابه
Estimation of Returns to Scale in the Presence of Undesirable (bad) Outputs in DEA when the Firm is Regulated
The calculation of RTS amounts to measuring a relationship between inputs and outputs in a production structure. There are many methods to measure RTS in the primal space or the dual space. One of the main approaches is using the multiplier on the convexity constraint. But returns to scale measurements in DEA models are affected by the presence of regulatory constraints. These additional constr...
متن کاملPresenting a Hybrid Approach based on Two-stage Data Envelopment Analysis to Evaluating Organization Productivity
Measuring the performance of a production system has been an important task in management for purposes of control, planning, etc. Lord Kelvin said :“When you can measure what you are speaking about, and express it in numbers, you know something about it; but when you cannot measure it, when you cannot express it in numbers, your knowledge is of a meager and unsatisfactory kind.” Hence, manag...
متن کاملDiagnostic and therapeutic challenges for dermatologists: What shall we do when we don’t know what to do?
What shall we do when we have done everything we could for the diagnosis and treatment of a patient, but were not successful? What shall we do when there is no definite treatment for a patient? What shall we do when we have no diagnosis or treatment for a patient? Some useful suggestions are presented here to get rid of these situations.
متن کاملMeasuring the overall performances of decision-making units in the presence of imprecise data
Data envelopment analysis (DEA) is a method for measuring the relative efficiencies of a set of decision-making units (DMUs) that use multiple inputs to produce multiple outputs. In this paper, we study the measurement of DMU performances in DEA in situations where input and/or output values are given as imprecise data. By imprecise data we mean situations where we only know that the actual val...
متن کاملAn Integrated Approach for Measuring Performance of Network structure: Case study on power plants
Data envelopment analysis (DEA) and balanced scorecard (BSC) are two well-known approaches for measuring performance of decision making units (DMUs). BSC is especially applied with quality measures, whereas, when the quantity measures are used to evaluate, DEA is more appropriate. In the real-world, DMUs usually have complex structures such as network structures. One of the well-known network s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1702.01624 شماره
صفحات -
تاریخ انتشار 2017